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Reduction and deletion processes occur regularly in conversational speech. A segment 
that is affected by such reduction and deletion processes in many Germanic languages 
(e.g., Dutch, English, German) is A/. There are similarities concerning the factors that 
influence the likelihood of final /t/ to get deleted, such as segmental context. However, 
speakers of different languages differ with respect to the acoustic cues they leave in the 
speech signal when they delete final /t/. German speakers usually lengthen a preceding 
/s/ when they delete final AA This article investigates to what extent German listeners are 
able to reconstruct A/ when they are presented with fragments of words where final A/ has 
been deleted. It aims also at investigating whether the strategies that are used by German 
depend on the length of /s/, and therefore whether listeners are using language-specific 
cues. Results of a forced-choice segment detection task suggest that listeners are able 
to reconstruct deleted final A/ in about 45% of the times. The length of /s/ plays some 
role in the reconstruction, however, it does not explain the behavior of German listeners 
completely. 
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INTRODUCTION 

In normal conversational speech situations, speakers seem to be 
rather "careless." One of the most striking results of this careless 
speech is that speakers often reduce words (e.g., Ernestus, 2000; 
Johnson, 2004; Zimmerer, 2009). Reductions are rather "min- 
imal" when the number of segments remains unchanged. For 
instance, segments can be assimilated as in German Senf ("mus- 
tard") which may be produced as [zenjf] instead of [zsnf] (e.g., 
Zimmerer et al., 2009). Segments can be lenited, for example, 
medial Itl may be produced as (reduced) flap in American English 
(e.g., Kiparsky, 1979; Patterson and Connine, 2001; Connine, 
2004; Fukaya and Byrd, 2005; Tucker, 2007; Warner and Tucker, 
2011 and references therein). However, reduction processes can 
alter the pronunciation of words more dramatically. For instance, 
segments can be deleted, which might even have an impact on the 
syllabic structure of words (e.g., Johnson, 2004; Zimmerer, 2009). 

Generally, it seems that listeners are unimpressed by reduc- 
tions (including deletions) in normal listening conditions despite 
the high amount of reductions that speakers produce. It is, how- 
ever, not yet understood, how they deal with these reductions 
and deletions, which may be due to the use of ideal, unreduced 
stimuli (so called "laboratory speech") in many perception exper- 
iments. Although there is a better control over what listeners hear 
in these circumstances, experiments using exclusively laboratory 
speech might not tell us how listeners deal with reductions occur- 
ring in natural speech. This article aims to better understand the 
processes that lead to the ease of perception of reduced speech. 

A number of studies that used reduced items in perception 
experiments showed that reduced words seem to be harder to 
process by listeners, especially if the sentential context is not 
present (e.g., Pickett and Pollack, 1963; Pollack and Pickett, 1963; 



Ernestus et al., 2002; Ernestus and Baayen, 2007; Zimmerer, 2009; 
van de Ven et al., 2011; Zimmerer et al., 2012), which appears to 
be somewhat contradicting the observation that listeners usually 
fair well in recognizing what has been said. One possible expla- 
nation for the apparent ease of perception of reduced speech is 
that (at least some of the) segments are reconstructed during the 
course of perception, due to fine phonetic detail in the input (e.g., 
Manuel, 1991, 1995; Hawkins, 2003; Niebuhr and Kohler, 2011). 
This article investigates the reconstruction of a single deleted seg- 
ment, namely the alveolar voiceless stop It/ in word-final position 
in German 1 . 

Reconstruction of deleted segments has been shown for several 
sounds. For instance, Manuel (1991) showed that in words with a 
deleted Schwa (e.g., a word like support sounding like sport), fine 
phonetic cues were used by listeners to differentiate the two words 
(reduced support and sport, Manuel, 1991). Similarly, Manuel 
found evidence that listeners were able to reconstruct deleted /57 
in nasal contexts, based on fine phonetic detail in synthetically 
created stimuli. Fine phonetic detail has also been shown to be 
important for the perception of more massive reductions (e.g., 
Hawkins, 2003; Niebuhr and Kohler, 201 1). 

In the literature, language-specific and cross-linguistic tenden- 
cies have been identified, both with respect to the production as 



Note, that we call the process {Itl) deletion, despite some evidence that lis- 
teners produce some phonetic cues that could be interpreted as remnants of 
Itl. For us, items that were transcribed as /t/-less in the corpora we used in 
this investigation are called deleted. This is done to differentiate this process 
from reductions like flapping that is more regular. Furthermore, we apply this 
strategy also to other cases from the literature (e.g., Manuel, 1991) which are 
sometimes called "seemingly" deleted. 
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well as the perception of deleted segments. For instance, the seg- 
ment Itl (especially in final position) is very likely to be reduced in 
Germanic languages (see e.g., Guy, 1980; Neu, 1980; Sumner and 
Samuel, 2005; Mitterer and Ernestus, 2006; Raymond et al., 2006; 
Zimmerer, 2009). Concerning this deletion process, researchers 
have identified many aspects that are very similar across a number 
of languages as well including aspects of the context, in which Itl 
occurs (see next section). However, there are also aspects in pro- 
duction and perception that appear to be language-specific, for 
instance the role of fine phonetic detail. When speakers leave these 
cues to signal listeners a deletion in the speech signal, and thereby 
help them reconstruct segments, they arguably differ across lan- 
guages, because the phonetic realization of segments (in case of 
Itl: the amount of aspiration, closure duration, or voicing pat- 
terns) and consequently reduction processes are quite different 
cross-linguistically. 

HI DELETION IN PRODUCTION 

The voiceless coronal stop Itl has been studied intensively. One 
of the reasons is that the segment is relatively frequent addition- 
ally, it is deleted quite regularly in many (Germanic) languages 
(e.g., Guy, 1980, 1991; Neu, 1980; Mitterer and Ernestus, 2006; 
Raymond et al, 2006; Zimmerer et al, 2011, 2014) 2 . 

Cross-linguistically, (phonological) context proved to be one 
of the most important factors that can influence the deletion of 
Itl (Mitterer and Ernestus, 2006; Raymond et al, 2006; Zimmerer 
et al, 2011, 2014; Mitterer and Tuinman, 2012). Mitterer and 
Ernestus (2006), for instance, investigated final Itl deletion by 
analyzing two different corpora of spoken Dutch. These corpora 
differed with respect to both the speech register that was recorded, 
as well as the vocabulary that had been used. Their results showed 
that the likelihood of Itl being deleted in Dutch was highest when 
preceded by the fricative Is/ and followed by bilabial sounds. 
Zimmerer et al. (2011, 2014) investigated the deletion of Itl with 
help of two newly created corpora that were obtained with a verb 
paradigm production method which ensured that final Itl was 
(at least in one condition) always preceded by Isl, and followed 
either by Is/, by Ivl or by a vowel. They found overall deletion 
rates of 20% in the first corpus (Zimmerer et al., 2011) and 27% 
(Zimmerer et al., 2014) in /-st/ contexts of the second corpus. 
Concerning context to the right, deletion rates were highest when 
the Itl was followed by Isl and lowest when it was followed by 
a vowel. Deletion rates were intermediate when final Itl was fol- 
lowed by the fricative Ivl. These results were stable across the two 
corpora. Similar results were reported by Mitterer and Tuinman 
(2012) who investigated final Itl deletion by native Dutch speakers 
and German speakers of Dutch. The context for final Itl was either 
a preceding In/ or a preceding Isl, and Itl deletion was most likely 
when preceded by Is/. However, they found different patterns for 
verbs and nouns in the two language groups. While Dutch speak- 
ers showed the same pattern for verbs and nouns, that is, higher 



2 Regularity in this case refers to the fact that the deletions occur in noteworthy 
percentage of the possible cases. This does not necessarily indicate a phono- 
logical regularity. Note, however, that the notion of variable rules has been 
used to describe the process (e.g., Guy, 1980; see also Cedergen and Sankoff 
1974). 



deletion rates after Is/ than after In/, German speakers behaved 
(slightly) differently. Deletion rates were higher when the Itl was 
preceded by In/ for verbs produced by German speakers, but with 
nouns they behaved like Dutch speakers, having more deletions 
after Isl than after In/. Overall, the speakers produced deletion 
rates of 25% for nouns and 40% for verbs. 

Across different studies, other linguistic and extra-linguistic 
factors have also been identified as influencing the amount of Itl 
deletion, such as speaking rate (more deletion in faster speech, 
cf., Guy, 1980, 1991 see also Byrd and Tan, 1996), speech reg- 
ister (more deletions in less formal speech, cf. Mitterer and 
Ernestus, 2006), fluency (less deletions in non-fluent speech sec- 
tions, cf., Raymond et al., 2006; Zimmerer et al., 2011, 2014), 
social class and dialectal differences (cf., Labov, 1966; Wolfram, 
1967), speaker age (tendency for more deletions by younger 
speakers, cf., Guy, 1991), word category (more deletions in func- 
tion words than content words, cf., Neu, 1980), relative frequency 
(the likelihood of Itl to be deleted was correlated with its like- 
lihood to be decomposed in words like daftly and swiftly, cf, 
Hay, 2003). Speaker gender was not consistently found to have an 
impact on the amount of Itl deletion [a tendency for more dele- 
tions by male speakers compared to female speakers, was found, 
for instance by Wolfram, 1967; Neu, 1980, but not found by 
Raymond et al., 2006 — analyzing the Buckeye Corpus (Pitt et al., 
2007); 3 or Zimmerer et al., 2014]. 

The factors that have been found to influence Itl deletion 
in different investigations of different Germanic languages seem 
to point to cross-linguistic similarities, such as segmental con- 
text. These similarities concern mainly factors that influence the 
amount or likelihood of Itl deletion. However, the studies reported 
above also reveal some language-specific differences. These differ- 
ences mainly concern the actual realization of items where the Itl 
was deleted. For Dutch, for instance, Mitterer and Ernestus found 
that speakers kept a preceding Isl rather short when they deleted 
Itl. This short Isl could be interpreted as a cue to a deleted Itl, 
because in final consonant clusters, such as /st/, the Isl was pro- 
duced shorter than if it was a single segment in final position 
in Dutch (Mitterer and Ernestus, 2006). Interestingly, German 
speakers used the opposite strategy. When final Itl in an /-st/ 
cluster was deleted, German speakers tended to lengthen the pre- 
ceding Isl (Zimmerer et al, 2011, 2014). The difference between 
Dutch and German speakers also raises interesting questions con- 
cerning the perception or restoration of deleted Itl which will be 
discussed in the next section. 

Itl DELETION IN PERCEPTION 

There have been several studies addressing the perception of 
naturally occurring variants of Itl, including its reduction and 
deletion. For instance, Sumner and Samuel (2005) investigated 
released and globalized variants of final Itl and their possible 
role as being represented in long-term memory. When listeners 
encounter variants of Itl, one could argue that instead of recon- 
structing the segment, they could also have stored these variants 
directly. While in their short-term priming experiments all vari- 
ants activated the correct lexical entry equally well, the results 



3 They also investigated deletion of medial Id/. 
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from a long-term priming experiment suggested, that only canon- 
ically produced variants (with a full Itl) are stored in long-term 
memory. These findings indicate that variants with reduced Itl 
can be handled well by listeners, but the results not lend sup- 
port for a direct storage of these variants in long-term memory 
(Sumner and Samuel, 2005). Furthermore, even if we assume that 
variants of deleted Itl were stored in memory, these items could 
interfere with other words that actually do not have the Itl in 
their canonical form. For instance, when the German word hau-st 
("hit — 2nd PERS. SG.") is produced without final Itl, the resulting 
word will be Haus ("house"). This means that listeners would still 
benefit greatly from strategies to reconstruct deleted Itl even if (at 
least some of) the variants were stored in the mental lexicon. 

The perception of reduced Itl was also investigated with 
its flapped variant. Results were mixed: Some studies showed 
that flapped variants were perceived (measured as amount of 
lexical activation) as well as not-flapped variants in American 
English (Luce and McLennan, 2005; McLennan et al., 2005), other 
researchers showed that the reduced flap (e.g., unreduced flaps 
[pAf 1] as opposed to reduced [pAf| ] ) was not as acceptable in per- 
ception than the unreduced flap (e.g., Tucker, 2007). A possible 
explanation for the difference between these studies is the status 
that has been assigned to the flap. Tucker assumed the flap ver- 
sion as canonical and only the reduced flap version as a reduced 
variant, McLennan and colleagues treated the flap as a reduced 
variant of an underlying Itl. 

Concerning deleted Itl, Mitterer and Ernestus (2006) inves- 
tigated the extent to which the (acoustic) patterns that were 
produced by speakers of Dutch — the relatively short Isl as a 
cue to an underlying final consonant cluster (see Itl deletion 
in production) — had an impact on perception of deleted Itl. 
They investigated in a perception study with resynthesized stim- 
uli whether Dutch listeners were able to use the cues of a short 
hi to reconstruct final Itl. Furthermore, Mitterer and Ernestus 
were interested whether there is a difference in Itl reconstruc- 
tion depending on the context, that is, whether in the Isl context, 
where Itl deletion occurs more often, Dutch listeners are more 
likely to reconstruct Itl than in the In/ context, which is not as 
prone to the deletion of Itl. The findings of Mitterer and Ernestus 
suggest that this is indeed the case. Dutch listeners are more likely 
to reconstruct Itl in the Isl context than in the In/ context (see also 
Mitterer and McQueen, 2009 for similar results), and the listeners 
seem to use the cue of Is/ shortness to reconstruct final Itl. This 
also suggests that listeners are well aware of production patterns 
and use this information in the perception of deleted Itl . 

In another study, Mitterer and Tuinman (2012) investigated 
the extent to which the cues left in the acoustic signal are 
language-specific for the reconstruction of final Itl with native 
Dutch participants and German learners of Dutch. For stimuli 
similar to the ones used in Mitterer and Ernestus (2006), listen- 
ers were most likely to perceive a word containing Itl the more 
evidence for Itl was present. Also, participants perceived more 
often a Itl, if a possible reconstruction created a word, and if the 
preceding context was Isl. However, there were also differences 
between German and Dutch listeners. German listeners were in 
some cases overgeneralizing in the reconstruction of Itl. Mitterer 
and Tuinman interpreted the difference in reconstruction rate as 



partly being conditioned by transfer from their German native 
language, where Itl deletion is overall less frequent, and where the 
cues for deleted Itl may be different. This raises the question of 
the use of language-specific cues for reconstruction of Itl, which 
will be addressed in this article. 

RESEARCH QUESTIONS 

The short overview over production and perception of deleted Itl 
suggests that there are both cross-linguistic tendencies, such as 
context of final Itl, and language-specific processes. This article 
aims to investigate to what extent perception and reconstruc- 
tion of Itl deletion in German is language specific. Two research 
questions are addressed with a forced-choice segment detection 
method: 

• To what extent are German listeners able to reconstruct seem- 
ingly deleted Itl 7 . In German, about 20% of final /t/s get deleted 
(e.g., Zimmerer, 2009; Zimmerer et al., 2011, 2014). This 
means that listeners are faced with the process of Itl deletion 
quite frequently. 

• The second question addresses the language-specific aspect of 
reconstruction. Dutch listeners have been shown to use the fine 
phonetic details, such as Isl length as cues for reconstruction 
(e.g., Mitterer and Ernestus, 2006; Mitterer et al, 2008). The 
findings of Zimmerer and colleagues suggest that the length cue 
is different in German compared to Dutch, and the results of 
Mitterer and Tuinman (2012) further indicate that perception 
might be different for German and Dutch listeners. Therefore, 
the question arises whether German listeners are aware of the 
Isl length cue and use it to reconstruct deleted Itl. 

MATERIALS AND METHODS 
PARTICIPANTS 

In all, 14 native (11 female) German speakers participated in the 
experiment. They were between 20 and 34 years old (mean 24.9) 
and were recruited at the Goethe-University, Frankfurt. They 
received payment in kind (cookies and coffee/tee/juice) for their 
participation. All of the participants spoke standard German and 
were born in Hesse, or Northrhine-Westphalia. All of them had 
lived for more than 5 years in Frankfurt area. None of the par- 
ticipants reported any hearing problems. They were naive with 
respect to the purpose of the study, and were told about the 
process of Itl reconstruction only after they completed the exper- 
iment. An experimental session lasted about 25min, including 
reading the instructions and the practice session. 

MATERIALS 

The basis for the experimental stimuli were utterances recorded 
for a verb paradigm production corpus (Zimmerer et al., 2014). 
The corpus consists of paradigm productions of 50 different 
German verb forms. Participants had produced paradigmatic cells 
of verbs, where a verb form was always preceded by its correct pro- 
noun. For instance, for the verb hauen ("hit-lNF"), a paradigmatic 
cell was "du hau st, sie haut, ihr haut, sie hauen' ("you-2nd PERS 
SG. hit, she hits, you-2nd PERS. PL. hit, they hit") — the context 
where Itl deletion can occur is the second person singular ending 
of the person/number suffix — st (e.g., du hau-st "you 2nd PERS. 
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SG. hit"), which is underlined. In some cases, 3rd person singu- 
lar renditions can also end in /st/, however, the morphological 
structure of these forms is different (e.g., er haus-t "he dwells"). 

All items used in the forced-choice experiment were either 
fragments of verbs or fragments of verbs that were followed by 
(fragments of) pronouns from this corpus. All items were excised 
with Praat (Boersma and Weenink, 2013). For deleted /t/ items, 
fragments were extracted from the paradigms that included the 
vowel of the verb stem, followed by /s/, and the subsequent word 
could begin in either [si] (from sie "she/they") or Nil (from wir — 
"we") 4 . This procedure ensured that the items did not sound 
word-like. Therefore, possible /t/-decisions for /t/-deleted items 
(henceforth 0t-items) were not based on lexical influence, where 
listeners could have reconstructed a Itl more often in order to 
create an existing word form in German (e.g., Ganong, 1980; 
Mitterer and Tuinman, 2012). 

In a pretest, 93 0t-items from the corpus were presented to 
5 listeners for transliteration, all were students of the institute 
of Phonetics in Frankfurt. They were asked to write down what 
they heard, and had 4 s to give a transcription for every item, 
which they could listen to only once. The instructions were kept 
very simple (i.e., "write down what you think you heard"), with- 
out mentioning the segment /t/ or any reduction process or the 
fact that they were excised from verb paradigm productions. For 
the experiment reported here, 30 items which were transliterated 
without Itl were used (i.e., an item counted as being transliterated 
by all listeners without Itl when it was written with neither "t" nor 
"z") 5 . This procedure ensured that items were used which were 
rated /t/-less when no close attention was paid to the segment Itl. 

For the experiment, a total of 180 stimuli were excised from 
this verb paradigm production corpus (Table 1 gives an overview 
over the segmental make-up of the stimuli). Overall, 90 of these 
items had a Itl present (+t-items). These were 45 stimuli with a 
Itl (t-items) intervocalically, such as [arts], a fragment based on 
braten ("fry-lNF"), and further 45 stimuli which had Itl occur- 
ring in an Is/ context, that is, a /ts/ (which also counted as a Itl) 
like in [i:stsi] from ". . . fl iehst, sie ..." ("flee 2nd PERS. SG., she") 
(ts-items). Note that the affricate Its/ is a phoneme of German 



In a canonical standard German production, the pronoun sie ("she") should 
be produced as [zi:]. However, in conversational speech, the initial Izl is very 
often devoiced (cf. the Kiel Corpus of spontaneous speech, IPDS, 1994). Only 
[s] was produced in the items that were used for the experiment. 
5 In German orthography the letter "z" denotes the affricate [ts]. 



which is written "z" in orthography (see also footnote 5). The seg- 
ment sequence [ts] in the ts-items could possibly also interpreted 
as an affricate, which could have repercussions for the results (see 
section Results and Discussion and Conclusions). However, these 
items were included in order to have [s] present in all experi- 
mental conditions and not only in the —t and 0r conditions. 
Furthermore, the inclusion of these items is very close to the seg- 
mental and syllabic structure of the Qt-items. Then there were 
60 items without an underlying Itl (—t-items). These were 30 
stimuli which had a fricative hi or Izl preceded and followed 
by a vowel (s-items) such as [i:ss] excised from fl iefie n ("flow- 
INF") and 30 stimuli which had another consonant (e.g., In/), 
intervocalically (n-items) such as [ana] which was part of b arme n 
("ban-lNF"). Finally, the third group (30 stimuli) had the Itl 
deleted (Qt-items). One example is a fragment from the paradigm 
where in ". . . fl iehst, e r ..." ("flee 2nd PERS. SG., he") the Itl was 
deleted and the fragment was [i:se] . By definition, in the 0t-items, 
Itl was deleted word finally and followed by either a consonant 
(Ivl or Is/), or by a vowel if the deletion lead to a sequence of two 
/ss/ where no boundaries between the segments could be estab- 
lished which were treated as one single Is/. As can be seen in 
Table 1, the number of +t-items is higher than the number of 
—t-items. Because the 0t-items were transcribed without "t" in 
the pretest, and we did not know to what extent German listen- 
ers would be reconstructing "t," we counted these as instances of 
t-less items as well. This led to 90 items with "t" and 90 items 
without "t." 

The 180 items used for the experiment were produced by 
ten speakers of the verb paradigm production corpus mentioned 
above (Zimmerer et al., 2014). Individual speakers contributed 
between 11 and 22 items for the experiment. These speakers 
were also students at the Goethe-University who spoke standard 
German and had spent at least 5 years in Frankfurt. None of the 
speakers participated in the perception experiment. 

DESIGN AND PROCEDURE 

An experimental trial consisted of a warning tone, which was 
followed by 250 ms of silence. Then, the items were presented, 
after which participants had 1500 ms to decide whether the item 
they heard had a Itl present or not, before a new trial began. 
Participants were asked to press the respective response button 
("t" or "no t") on a response box with their dominant index 
finger. Response measurements were conducted with help of a 
custom-made software and hardware combination (Reetz and 



Table 1 | Stimuli in the experimental conditions. 


Experimental condition Item-type 


CVC-structure 


Example in IPA 


Mean duration 


No. of stimuli 


+t t-items 

ts-items 


VtV 

V(C)tsV 


[arte] 
[i:stsi] 


262 ms 
423 ms 


45 
45 


— t s-items 

n-items 


VsV 
VCV 


[i:se] 
[ane] 


320 ms 
256 ms 


30 
30 


0t 0t-items 


Vst-(C)V 


[i:se] 


392 ms 


30 



SUM 332 ms 180 
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Kleinmann, 2003) where response boxes (with two responses 
buttons labeled "t" and "no t") were connected to an external 
device and subsequently saved onto an Apple Mac Book Pro. 
Participants were tested wearing Sennheiser eH-350 headphones. 

Three experimental lists were created with the 180 items. Each 
list was pseudo-randomized and between the experimental lists, 
participants could take a break. Overall, they decided 540 times 
whether a fragment had a III present or not. Participants were 
tested in groups of four or less. They received written instructions 
before the experiments. They were asked to respond as quickly 
and as accurately as possible, whether the fragment they heard 
had a "t-sound" present or not. Participants received a training 
section with 14 items that were not part of the experiment. The 
training section preceded the actual experiment to familiarize the 
participants with the task and the procedure of the experiment. 

Accuracy rates were calculated as percent correct responses 
for the respective category (after exclusion of no responses and 
responses that were too fast or too slow). 

PREDICTIONS 

• If the It/ was indeed completely deleted (0t-items), partici- 
pants should treat these items as they would treat items where 
no III was present (—t-items). However, based on the results 
reported by Mitterer and colleagues (Mitterer and Ernestus, 
2006; Mitterer and Tuinman, 2012) and based on the recon- 
struction of other segments, listeners may be able to use the 
cues that are left in the speech signal to reconstruct III in some 
cases, and we expect some qualitative difference compared to 
items where no III is present (—t-items). We do not expect 
reconstruction rates that are identical to underlying items (— t- 
or +t-items). 

• For the items where no III was present or where there was an 
underlying III (i.e., —items and +t-items), we expect very high 
accuracy rates, because the task itself is rather straightforward 
and it should not lead to high error rates. 

RESULTS 

In a first analysis, we investigated how accurate participants were 
responding to the fragments they had to listen to. Each of the par- 
ticipants was expected to respond to 540 fragments. Therefore, 
7560 responses were expected (3 repetitions x 180 items x 14 
listeners). Before examining the reconstruction of III, it is impor- 
tant to see how participants responded to clear cases where a III 
was present or where no III was present (i.e., the +t- and —t- 
items). The responses to +t-items and —t-items were also used 
to potentially exclude participants that showed a high error rate 
in these response categories. For the two underlying categories, 
6300 responses were expected (3 repetitions x (90 + 60) items x 
14 listeners). Of these, there were 64 cases where no response was 
given which were excluded from further analysis. Furthermore, 
117 cases of responses that were faster than 150 ms and slower 
than 900 ms were also excluded. Wrong responses occurred when 
participants pressed "t" for —t-items or "no t" for +t-items (217 
responses — 20 s-items, 19 n-items, 22 t-items, 156 ts-items). For 
the statistical analysis, errors were coded with "0" and correct 
responses with "1." Table 2 gives an overview over the accuracy 
rates [Least Square Means (LSM) as well as Means] for the +t- 



and —t-items. Accuracy rates for individual participants ranged 
between 99.3 and 90.7%; no participant was excluded. For the 
analysis, a linear mixed model was calculated with IMP (SAS, 
2012), with ITEM and PARTICIPANT as random factors, item- 
condition (t-item, ts-item, s-item, n-item) as factor, and accuracy 
as dependent variable. Results indicate that ITEM-CONDITIONS 
differed significantly [P(3, 144.2) = 30.47; p < 0.0001]. This was 
driven by the lower accuracy rate for ts-items that were signif- 
icantly different from all other ITEM-CONDITIONS, which did 
not differ significantly, as was indicated by a Tukey HSD post-hoc 
test. 

For the next analysis, that is, the comparison with 0t-items, 
there was no difference made between t-items and ts-items (i.e., 
the +t-items) on the one hand and s-items and n-items (i.e., 
the — t-items) on the other hand. 

In a next step, response categories for 0t-items were com- 
pared to responses to the two underlying item categories. For 
this analysis, cases where no response was given (91 responses) 
or responses that were too fast or too slow (154 responses) were 
excluded from further analysis. 0t-items were responded to with 
"t" in 45% of the time (541 responses), whereas they received a 
"no t" response in 655 cases (55%). For statistical analysis, we 
treated "no t" responses to 0t-items as "correct" to be able to 
compare the results with the underlying items in a linear mixed 
model. Figure 1 shows the responses given to the respective cate- 
gories (+f-, — f- and 0t-items), whereas Table 3 reports the LSM 
for the respective categories. The linear mixed model with ITEM 



Table 2 | Least Square Means (LSM) and Means of Accuracy for the 
underlying items 3 . 



Experimental 


Item-type 


LSM 


SE 


Mean 


SE 


condition 




accuracy 








+t 


t-items 


0.988 


0.009 


0.988 


0.004 




ts-items 


0.913 


0.009 


0.913 


0.004 


-t 


s-items 


0.983 


0.01 


0.984 


0.005 




n-items 


0.985 


0.01 


0.985 


0.005 



"Accuracy was calculated as percentage of correct responses to the items of 
the respective category. 




+t-items -t-items 0t-items 



FIGURE 1 | Percent of different responses to the respective categories, 
"no t" responses are in blue, while "t" responses are depicted in red. 
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Table 3 | Least Square Means (LSM) of Accuracy for different item 
categories. 



Experimental condition 


LSM accuracy 


SE 


+t 


0.95 


0.013 


-t 


0.98 


0.015 


0t 


0.55 


0.02 



and PARTICIPANT as random factors, ITEM-CONDITION (+t, —t, 
0t) as factor and ACCURACY ("0" for incorrect, "1" for correct) as 
dependent variable showed that ITEM-CONDITION was a signifi- 
cantfactor [%, 176.4) = 223.84;p < 0.0001], and that each of the 
three conditions differed significantly. This means that deleted Itl 
was reconstructed in a little less than half of the possible cases. 

As indicated in the Materials and Method section, the number 
of stimuli was not evenly distributed for the three conditions (i.e., 
90 +t-items, 60 —t-items and 30 0t-items). This could have led 
participants to create a response bias. Therefore, we also used d' 
as a way to account for possible response biases (e.g., Stanislaw 
and Todorov, 1999; Macmillan and Creelman, 2005) 6 . The d' 
values were analyzed to compare sensitivity differences for the 
different items. A linear mixed model was calculated with PAR- 
TICIPANT as random factor and comparison as fixed factor, as 
well as d, as dependent variable. Results indicate that COMPAR- 
ISON is a significant factor [F( 2 , 26) = 74.9;p < 0.0001]. A Tukey 
HSD test showed that all comparisons were different from each 
other. Table 4 indicates the LMS for the different comparisons in 
this analysis. The d' analysis thus shows the same tendency as the 
accuracy ratio analysis. 

A closer inspection of the different items showed that there was 
considerable variation between individual 0t-items. The amount 
of "no t" responses ranged from 95 to 15.4%. Figure 2 shows the 
percent "no t" responses for each of the 0t-items. 

In a next step, we analyzed whether participants showed dif- 
ferent responses to the items in the course of the experiment, 
that is, whether they changed their reconstruction behavior over 
time. To this end, we calculated a linear mixed model with 
ITEM and PARTICIPANT as random factors, ITEM-CONDITION 
(+f, — t, 0t), PRESENTATION (first, second, third) and the inter- 
action of ITEM-CONDITION and PRESENTATION as fixed factors 
and ACCURACY ("0" for incorrect, "1" for correct) as depen- 
dent variable. The results show that ITEM-CONDITION was a 
significant factor [F( 2 , 176.5) = 22.94, p < 0.0001], as was PRE- 
SENTATION [P( 2 , 7118) = 3.82, p < 0.05]. The interaction ITEM- 
CONDITION x PRESENTATION turned out to be significant as 
well [f(4, 7117) = 8.07, p < 0.0001]. In this model, concerning 
ITEM-CONDITION, 0t-items differed from the other two item 
groups, but +t-items and —t-items items did not differ signifi- 
cantly from each other. When we look at the factor presentation, 
the first PRESENTATION had an overall accuracy rate of 83.6%, 
the second presentation was responded to correctly in 83% of the 
cases, whereas in the third presentation, participants responded 



°In cases where participants had an accuracy rate of 100% (this occurred 
for —t-items only, we set the hit rate to 0.99 by adding one miss to the 
performance — this occurred for 7 of the participants). 



Table 4 | Least Square Means (LSM) of d' in the respective 
comparisons. 



Experimental condition 


LSM accuracy (d ) 


SE 


-t vs. +t 


3.98 


0.17 


-tvs. 0t 


2.39 


0.17 


0t vs. +t 


1.87 


0.17 



correctly in 81.5% of the cases (See Figure 3). A Tukey HSD 
post-hoc test showed that the third condition was significantly 
different from the first one, but the first presentation was not dif- 
ferent from the second, nor was the second from the third. Finally, 
the significant interaction of ITEM-CONDITION x PRESENTA- 
TION was driven by the fact that the 0t-items showed different 
accuracy rates in the three presentations. Participants responded 
with "no t" to 0t-items in 59.1% during the first presentation, 
54.9% during the second presentation and 50.2% during the third 
one. The Tukey HSD post-hoc analysis indicates that the first and 
third presentations of 0t-items were different from each other, but 
the first and second were not, nor were the second and the third. 
The +t-items and the —t-items did not show significant differ- 
ences in the three presentations, but were significantly different 
from all 0t-item presentations. 

A final investigation analyzed a possible correlation between 
the reconstruction of Itl and the s-length of the 0t-items. The cor- 
relation that was found was significant but not very strong [r 2 2 ^ = 
0.03; p < 0.0001]. The analysis showed that the longer the Is/ in 
the signal, the more likely were participants to reconstruct Itl. 

DISCUSSION AND CONCLUSIONS 

The first research question we investigated in this article con- 
cerns the extent to which German listeners are able to reconstruct 
seemingly deleted Itl. In German, about 20% of final /t/s get 
deleted (e.g., Zimmerer, 2009; Zimmerer et al., 2011, 2014). If 
German listeners behaved similar to Dutch listeners, reconstruc- 
tion should be frequent, but it should also not occur in every 
instance. 

The results from a forced choice phoneme detection task indi- 
cate that listeners are at least sometimes able to do reconstruct 
deleted Itl. When they were faced with fragments from speech 
with a deleted Itl, reconstruction occurred in about 45% of the 
cases. The amount of Itl responses for the 0t-items is clearly dif- 
ferent from both +t-items (with more than 95% "t" responses), 
as well as from —t-items (with less than 5% "t" responses). This 
finding is also supported by the analysis of d, where 0t-items fell 
in between the clear cases of — t- and +t-items. 

Thus, compared to Dutch listeners, Germans seem to recon- 
struct deleted Itl less often (cf. Mitterer and Ernestus, 2006; 
Mitterer and Tuinman, 2012). One explanation for this differ- 
ence is the choice of stimuli in the experiment reported here, 
which are different from the experiments reported by Mitterer 
and colleagues. Mitterer and Ernestus used synthetically manipu- 
lated stimuli in sentential contexts (as did Mitterer and Tuinman), 
whereas in this study, only fragments from verbs and pronouns, 
that is, parts of real speech were used. These fragments were pre- 
sented without context and were not word-like. Therefore, the 
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FIGURE 2 | Percent "no t" responses to the different 0t-items. 



FIGURE 3 | Percent correct responses ("no t" for 0t and —t-items; "t" 
for +t-items) for the three presentations. 



stimuli in this study were on the one hand arguably closer to 
the kind of speech that is encountered by listeners in natural sit- 
uations, since no additional manipulation (e.g., synthesis) was 
performed. On the other hand, they were also less natural, since 
usually, speech never occurs without context. Furthermore, the 
stimuli used here allowed for less control over what cues speak- 
ers could have left for deleted Itl (see below). A third explanation 
for the different results could be that the stimuli here were not 
word-like, and thus, prevented listeners to adjust their perception 
by reconstructing a Itl. German listeners were shown to rely more 
on higher-level lexical knowledge when reconstructing Itl in their 
foreign language Dutch (Mitterer and Tuinman, 2012). Also, in 
"normal" speech, German verbs are produced with a pronoun. 
That pronoun additionally helps to perceive the correct word and 
to activate the intended meaning. Therefore, this study can be 
seen to indirectly show further evidence for the importance of 



context when listeners have to deal with deletions. If context is 
missing, listeners have been shown to need additional effort for 
successful recognition (e.g., Pickett and Pollack, 1963; Pollack and 
Pickett, 1963; Ernestus and Baayen, 2007; Zimmerer, 2009; van 
de Ven et al, 2011; Zimmerer et al., 2012). To some extent, the 
lower reconstruction rates of German listeners could be partly due 
to language-specific behavior, too. Production analyses revealed 
a slightly higher amount of Itl deletion in Dutch compared to 
German. Therefore, Dutch listeners are faced with the deletion of 
Itl more often and therefore might have a reconstruction strategy 
that is based more on phonetic cues (such as I si length or senten- 
tial context), whereas German listeners are more focused on the 
lexical information for reconstructing Itl. 

This interpretation is also connected to the second research 
question we addressed in this article, concerning the impact of 
length of Is/ preceding a deleted Itl on the reconstruction of It! 
(which seems to be language-specific). Dutch listeners have been 
shown to use the fine phonetic details of Is/ length as cues for 
reconstruction (e.g., Mitterer and Ernestus, 2006; Mitterer et al., 
2008), but results from production analyses of German speak- 
ers indicate that German speakers behave differently from Dutch 
speakers, therefore, reconstruction strategies might also be dif- 
ferent for German and Dutch listeners. The results of this study 
suggest that despite the consistent production of Is/ lengthening 
by German speakers, there seems to be only a small, but signifi- 
cant correlation between Is/ length and Itl reconstruction. On the 
one hand, this finding can be seen as another argument in favor 
of language-specific reconstruction strategies of German listeners 
when encountering deleted final Itl. At the same time, the rather 
small correlation indicates that listeners are not focusing solely on 
Is/ length when they reconstruct Itl. The rather small effect of Is/ 
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length could be explained partly by the nature of the stimuli (i.e., 
fragments without or with only minimal context). Another possi- 
ble explanation is the number of stimuli with deleted It/, which is 
quite small. The stimuli were all excised from the corpus without 
any consideration to I si length. Despite the consistent lengthen- 
ing of I si in case of It/ deletion, there is also overlap between 
the length of I si in cases where It/ is deleted and where It/ is not 
deleted (cf. Zimmerer et al, 2011, 2014). Therefore, listeners can- 
not be completely sure about the nature of the I si in the stimuli 
they encounter in the experiment. Furthermore, the fragments 
were rather short and speech rate (which may play an impor- 
tant role for the perception of I si length) cannot be estimated 
with high confidence by listeners, which might prevent them from 
using the cue of I si length consistently. 

One result of the experiment that could lead to further research 
in the future concerns the differences for the underlying cate- 
gories with respect to accuracy. The ts-items were more difficult 
for the participants to respond to correctly than the t-items, 
but apart from these, there was no difference between the other 
underlying categories. A possible explanation is that It/ may be 
acoustically less salient when preceding I si. This point is actually 
one of the explanations why It/ gets deleted in such contexts in 
the first place. The ts-items were similar to 0t-items concerning 
the segmental structure, and if deletion is regarded as extreme 
form of reduction, maybe some of the ts-items were reduced to 
some extent. However, we also cannot rule out an orthographic 
influence, because in German orthography, [ts] can be written in 
some cases as "ts" or as "z" in others. Therefore, a transfer to the 
decision "t" or "no t" may be additionally difficult in these cases. 
Listeners could have treated these items as underlying affricates 
and thus refrained from responding with "t." Especially this last 
possibility should be investigated in future research. It would be 
interesting to find out, whether listeners treat sequences of the 
segments [t] and [s] which are parts of the fragments across word 
boundaries the same or differently from underlying affricates [ts], 
in words like Mtitze ("cap") or Herz ("heart"), which could also be 
tested for the influence of orthography, because in some cases the 
"t" is written, in others not. 

An effect that also emerged in the results was that participants 
responded to 0t-items more often with "t" in the third presenta- 
tion compared to the first one. Possibly, this shows that listeners 
were able to learn during the experiment and focus on pho- 
netic cues that were left in the speech signal. Despite the overall 
tendency to reconstruct deleted It/ more often later in the exper- 
iment, not all items showed this trend. Individual items showed 
very different patterns across the three conditions some had also 
more "t" responses in the first presentation compared to the last 
one. The very different patterns of reconstruction and the finding 
that, overall, more deleted /t/s were reconstructed during the third 
presentation compared to the first one may also indicate that the 
overall asymmetry was not a decisive factor. If this was the case, 
we would have found more likely the reverse pattern. Because 
the number of +t-items is higher than —t-items, participants may 
have a "no t" bias in their response to press each button equally 
often. If listeners had the expectation that they should give an 
equal number of "t" and "no t" responses, we would expect even 
more "no t" responses during the third presentation, because the 



asymmetry would have built up an increased response bias. This 
was not the case, however. Taken all presentations together, It/ was 
not reconstructed in the majority of the cases by the participants. 

The fact that listeners are not able to restore It/ consistently is 
also shown by the varying success in It/ restoration for individ- 
ual stimuli, showing a range between 15 and 95%. No item was 
always (or never) restored and the variation between the items is 
considerable. At this point, we can only speculate about the cues 
that were left in the items which were leading to a more successful 
reconstruction compared to other cues, because I si length does 
not seem to be capturing the whole picture. Despite the signifi- 
cant correlation, it is extremely week. This finding also leads to 
a possible extension of the experiment for further research. The 
choice of stimuli in this experiment was based on a pretest, where 
only stimuli were chosen which were never responded to with a It/ 
sound. However, since reduction has been shown to be gradient, it 
could be interesting to include also items which were transcribed 
with It/ in some of the cases in the pretest. Arguably, in these 
items, more cues for It/ were left in the speech signal. The inclu- 
sion of such items could shed more light on the question what 
exactly the cues are and could also be used to investigate gradient 
activation (cf. Janse et al., 2007). 

A question that additionally could arise with respect to the 
pretest is the difference concerning the amount of reconstruc- 
tion in the pretest and in the phoneme detection experiment. At 
this point, we can offer only speculative explanations. First, the 
number of participants in the pretest was very small. Therefore, 
it could be possible that with more transcribers, more "t" tran- 
scriptions could have occurred for the items we chose as 0-items. 
Furthermore, the participants for the pretest listened to each 
stimulus only once, and they were completely unaware of the pur- 
pose of their transcriptions and therefore did not pay attention to 
It/. In the forced-choice experiment, however, participants con- 
centrated on the presence of It/. They had to make the decision, 
whether due to some phonetic detail, there could have been a It/ 
present in the stimulus, and they had to do so under some time- 
pressure. In some cases, they arguably made a "random" decision 
(nearly half of the 30 0t-items had accuracy rates in the range 
of 40-60%, cf. Figure 2). This attention to fine phonetic detail 
to come to a decision may also result in an overall increase in 
the likelihood to reconstruct it/. And because participants were 
told that there is either a "t" sound present (or not), they had to 
decide spontaneously; there was no third option. Actually, in this 
respect, the rather unnatural task ("press a button for "no t" or 
"t") might be even somewhat close to natural speech situation, 
because in these, especially, when possible ambiguities are created 
by the deletion of segments, listeners also have to decide (under 
time pressure, because conversation goes on), whether there was a 
It/ present or not, and they might use all the cues they can to come 
to that decision (including sentential context, of course, which 
was absent here). At this point, we cannot be sure, which of the 
explanations is most accurate and some of the arguments might 
seem speculative, but this question is also an interesting method- 
ological field for further research. Mitterer and colleagues cir- 
cumvented many of possible problems with the stimuli by using 
resynthesized stimuli where it is possible to control very tightly 
the acoustics of what is presented. The use of non-manipulated 
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stimuli in our experiment does not allow for such a control. At 
the same time, non-manipulated stimuli might be closer to what 
listeners are faced with in natural occurring speech. For future 
research, it may be important to use both natural stimuli excised 
from corpora with stimuli that were artificially resynthesized. 
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